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Abstract 

An algebraic tree T is one determined by a finite system of fixed 
point equations. The frontier Fr(T') of an algebraic tree T is linearly 
ordered by the lexicographic order <i. If (Fr(T) <e) is well-ordered, its 
order type is an algebraic ordinal. We prove that the algebraic ordinals 
are exactly the ordinals less than u" . 



1 Introduction 

Fixed points and finite systems of fixed point equations occur in just about all 
areas of computer science. Regular and context-free languages, rational and 
algebraic formal power series, finite state process behaviors can all be charac- 
terized as (components of) canonical solutions (e.g., unique, least or greatest, 
or initial solutions) of systems of fixed point equations. 

Consider the fixed point equation 



X = 1 + X 



over linear orders, where + denotes the sum operation (functor) on linear orders. 
As explained in |BE10| . its canonical (initial) solution is the ordinal cj, or any 
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linear order isomorphic to the ordering of the natural numbers. For another 
example, consider the system of fixed point equations 

X = Y + X 
Y = l + Y 

The first component of its canonical solution is w 2 , and the second component 
is u). Of course, there exist fixed point equations whose canonical solution is not 
well-ordered, for example, the canonical solution of 

X = X + l 

is u>*, the reverse of uj, and the canonical solution of 

X = X + 1 + X 

is the ordered set of the rationals. 

The above equations are quite simple since they involve no parameters. The 
unknowns X, Y range over linear orders, or equivalently, constant functions (or 
rather, functors) defined on linear orders. By allowing unknowns ranging over 
functions (or functors) in several variables, we obtain the ordinal w u as the first 
component of the canonical solution of 

F = G(l) 
G{x) = x + G(F(x)) 
F(x) = x + F(x) 

The second and third components of the canonical solution are functors which 
map a linear order x to x x lo 2 and x x u), respectively. 

We call a linear order algebraic if it is isomorphic to the first (or principal) 
component of the canonical solution of a system of fixed point equations of the 
sort 

Fi(x ,...,x ni ^i) = U, i=l,...,n (1) 

where ri\ — and each ti is an expression composed of the function variables 
Fj, j — 1, . . . , n, the individual variables Xq, . . . , £ nj _i, the constant 1 and the 
sum operation +. Moreover, we call a linear order regular if it is isomorphic to 
the first component of the canonical solution of a system (fTJ with n.j = for all 
i. Further, we call an ordinal algebraic or regular if it is an algebraic or regular 
linear order. 

From the results in |Hei l80 . it follows easily that an ordinal is regular if and 
only if it is less than w u (See also |BC01] .) It was conjectured in [BE07] that 
an ordinal is algebraic if and only if it is less than . Our aim in this paper 
is to confirm this conjecture. 
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Infinite structures may be described by finite presentations in several different 
ways. One method, represented by automatic structures, consists in describing 
a structure up to isomorphism by representing its elements as words or trees 
or some other combinatorial objects, and its relations by rewriting rules or 
automata. Another, algebraic approach is describing an infinite structure as the 
canonical solution of a system of fixed point equations over a suitably defined 
algebra of structures. Our results show that these methods are equivalent, at 
least for small ordinals Q 

An automatic relational structure Hod82 , KN95 is a countable structure whose 
carrier is given by a regular language and whose relations can be computed by 
synchronous multi-tape automata. A tree automatic structure |DT901 IDel041 
ICol04) is defined using tree automata, cf. [GS84] . It was proved in |Del04] that 
the automatic ordinals are exactly the ordinals less than . See also [KRS03 . 
In |Del04] , it is also shown that an ordinal is tree automatic if and only if it is 
less than cj" . Thus, an ordinal is automatic if and only if it is reg and 
is tree automatic if and only if it is algebraic. Actually the claim that every 
regular ordinal is automatic is immediate. It would be interesting to derive a 
direct proof of the fact that every algebraic ordinal is tree automatic, or the 
other way around. 

In a traditional setting, one solves a system of fixed point equations in an algebra 
equipped with a suitable partial order; there is a least element, suprema of 
w-chains exist, the operations preserve the ordering and least upper bounds 
of w-chains. Such algebras are commonly called continuous algebras (or uj- 
continuous algebras), cf. |GTWW77l IGue8l] . In this setting, one solution of 
this kind of system is provided by least fixed points. The classical Mezei- Wright 
theorem |MW67j asserts that such a solution is preserved by a continuous, order 
preserving algebra homomorphism. 

However, in several settings such as (countable) linear orders, there is no well- 
defined partial order but one can naturally introduce a category by considering 
morphisms between linear orders. A generalization of the classical Mezei- Wright 
theorem to the setting of "continuous categorical algebras" has been given in 
|BE10| . where least elements are replaced by initial elements, and suprema of 
w-chains are replaced by colimits of w-diagrams. Since trees, equipped with 
the usual partial order GTWW77I IGue811 IBElOj form an initial continuous 
categorical algebra, it follows that instead of solving a system of fixed point 
equations directly over linear orders, we may first find its least solution over 
trees, to obtain an algebraic or regular tree, and then take the image of this 
tree with respect to the unique morphism from trees to linear orders which 
assigns to a tree the linear order determined by the frontier of the tree. Thus, 
up to isomorphism, an algebraic (or regular) linear order is the frontier of an 
algebraic (or regular) tree. In this way, we may represent algebraic and regular 
linear orders and ordinals as frontiers of algebraic or regular trees, and this is 

1 For another result in this direction, see [Col04 . 
2 Compare this fact with Theorem 2 in [BKN08 . 
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the approach we take here. This approach is not new. Courcelle was the first 
to use frontiers of trees to represent linear orders and words, i.e., labeled linear 
orders. The origins of the notions of regular linear order and regular word go 
back to |Cour78a] . 

In our argument showing that every algebraic ordinal is less than uj u , we will 
make use of certain context-free grammars generating prefix languages, called 
"ordinal grammars" which seem to have independent interest. Equipped with 
the lexicographic order (see below) , the language generated by an ordinal gram- 
mar is well-ordered. We show that an ordinal is the order type of a language 
generated by an ordinal grammar if and only if it is less than w w . We then 
show how to translate an algebraic tree given by a system of equations to an 
ordinal grammar generating the frontier of the tree. 

The paper is organized as follows. In Section [2] we define regular and algebraic 
trees and linear orders, and regular and algebraic ordinals. Then, in Section [3l 
we establish some closure properties of algebraic linear orders, including closure 
under sum, multiplication, and w-power, and use these closure properties to 
establish that any ordinal less than w a is algebraic. Section @] is devoted to 
ordinal grammars and the proof of the result that an ordinal is the order type 
of the context-free language generated by an ordinal grammar, equipped with 
the lexicographic order, if and only if it is less than w w . Then, in Section [5] 
we show how to construct for an algebraic tree T (given by a system of fixed 
point equations) an ordinal grammar G such that the order type of the language 
generated by G equals the order type of the frontier of T. (The proof of the 
correctness of the translations is moved to an appendix.) The paper ends with 
Section [6] containing some concluding remarks. 

2 Linear orders, words, prefix languages, and 
trees 

2.1 Linear orders 

A linearly ordered set (A, <a) consists of a set A equipped with a strict linear 
order, i.e., an irreflexive and transitive binary relation <a that satisfies exactly 
one of the conditions 

x = y or x < A y or y < A x, 

for all x, y € A. If (A, <a) and (B, <g) are linearly ordered sets, a morphism 
h : (A, <a) — > (B, < B ) is a function h : A — > B such that for all x, y £ A, if x <a 
y then h(x) <b h(y). The collection of all linearly ordered sets and morphisms 
forms a category LO. We say two linearly ordered sets are isomorphic if they are 
isomorphic in LO. The order type of a linear order is the isomorphism class 
of the linear order in LO. We write o(A, <a) for the order type of (A, <a)- 
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A linearly ordered set (A, <a) is well-ordered if every nonempty subset of A 
has a least element. If a is an ordinal, then a is well-ordered by the membership 
relation £. When [A, <a) is well-ordered, there is a unique ordinal a such that 
(a, g) is isomorphic to (A, <a), and the order type of (A, <a) is conveniently 
identified with this ordinal. 

Remark. We remind the reader that if a — o(A, <a) and /3 = o(B, <b) are 
ordinals, where A, B are disjoint sets, then a + f3 is the order type of A U B, 
ordered by putting every element of A before every element of B; otherwise, 
imposing the given ordering on elements inside A or B. Similarly, if a n — 
o(A n , <„) is an ordinal for each n > 0, where the sets A n are pairwise disjoint, 
then J2 n a « is the order type of the union (J n A n ordered by making every 
element of A n less than every element in A m , for n < m, and imposing the 
given order on A n . 

The ordinal a x /3 is the order type of Ax B, ordered by "last differences" , i.e., 
where 

(a, b) < (a', 6') b < B b' or (b = b' and a <a a'). 



Last, a" is the least upper bound of all ordinals a n — a x . . . x a, for n < uj. 

For more definitions and facts on linear orders and ordinals we refer to [Roit901 
IRos82[ ISier58j . For later use, we prove: 



Lemma 2.1 Suppose that a is an ordinal and /3 is an infinite ordinal, so that 
1 + P = /3. If a is a successor ordinal then 

((3 + l)xa < /3xa+l. 

If a is or a limit ordinal, then 

(P + l)xa < fix a. 



Proof. We prove both claims by transfmite induction on a. The case when 
a = is clear. Assume that a is a successor ordinal. Then a = 7 + n for some 
positive integer n where 7 is or a limit ordinal. Then, using the induction 
hypothesis and the assumption 1 + /3 = j3, 

09 + l)xa - 08 + 1) x 7+03 + 1) xn 

< (3 x j + f3 x n + 1 

= /3 x (7 + n) + 1 

= /3 x a + 1. 
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Suppose now that a is a limit ordinal. Then, 

(/3 + l)xa = sup((^ + l)x 7 ) 

< sup(/3 X 7 + 1) 

< P X a, 

again by the induction hypothesis. □ 

2.2 Words 

We use N for the set of nonnegative integers. If n € N, then 

[n] = {0,...,n-l}, 

so that [0] = 0. The collection of all finite words on a set A is denoted A* . The 
empty word is denoted e. The set of nonempty words on A is A + — A* \ {e}. We 
denote the product (concatenation) of two words u,v E A* by uv. We identify 
an element a e A with the corresponding word of length one. 

When A is linearly ordered by <a, then A* is equipped with two partial orders. 
The prefix order, written u < p v, is defined by: 

u < p v v = uw, w / e, 

for some word w. The strict or branching order, written u < s v is defined 
by: u < s v if and only if for some words ui,u 2 , v 2 , and a,b E A, 

u = u\au2 

v = uibv2, and 

a <a b. 

The lexicographic order on A* is defined by: 

u <£ v -^==^> u < p v or u < s v. 
It is clear that for any u, v G A* , exactly one of the following possibilities holds: 

u = v, u < p v, v < p u, u < s v, v < s u. 
Thus, the lexicographic order is a linear order on A* . 

2.3 Prefix languages 

A prefix language on a set A is a subset L of A* such that if u € L and uv € L 
then v = e. For later use, we state the following facts about prefix languages. 
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Lemma 2.2 • If(P, <p) is a countable linearly ordered set, there is a prefix 
language L on [2] such that (P, <p) and (L, <e) are isomorphic. 

• If L is a prefix language, then, for u,v G L, u <i v u < s v. 

• If K, L C A* are prefix languages, where A is linearly ordered, so is KL = 
{uv : u G K,v G L} and, when (K, <e) and (L, <£) are well-ordered, 

o(KL,<i) = o(L, <£) x o(K, <i). 

• Suppose that L C [2]* is a prefix language such that a — o{L, <i) is an 
ordinal. Then 

((J l n OL n ,<e) 

n>0 

is well-ordered and, if a > 1, 

a" = o(|J l n QL n ,<£). (2) 

Proof. For the first statement we refer to |Cour78al IBE04] . We prove only the 
statement (|2|). The set, say L^, of words on the right side of ([2]) is the set of 
all words of the form \ n Qu, for u e L n . For any words u, v, if l n 0u < p l m Qv, 
then n = m and u < p v. Since L is a prefix language, so is L n and, thus so 
is U„ 1"0L". Also, if n < m, l"0u < s l m 0v, and l n 0u < s l n 0v if and only if 
u < s v. Thus, by definition, the order type of the lexicographic order of the 
prefix language L m is J2n a ™- ^ ^ s now eas y *o see that this sum is a". □ 

2.4 Trees 

Suppose that E is a (finite) ranked alphabet, i.e., a nonempty finite set parti- 
tioned into subsets of "fe-ary operation symbols", k > 0. Moreover, suppose 
that V — {xq, Xi, . . .} is an ordered set of "individual variables". A E-tree T 
in the variables V, or a "tree over £ in the individual variables V" , is a partial 
function [n]* — > E U V satisfying the conditions listed below. Here, n is largest 
such that E n ^ 0. The conditions are: 

• The domain of T, dom(T), is prefix closed: if T(uv) is defined, then so is 
T(u). 

• If T(u) G E fe , k > 0, and T(ui) is defined, then i G [k\. 

• If T(u) G Eg U V, then u is a leaf, and T(ui) is undefined, for all i. 
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A E-tree T is complete if whenever T(u) is defined in E n , for some u, then 
T(uO), . . . ,T(u(n — 1)) are all defined. Moreover, T is finite if its domain is 
finite. Below we will usually denote finite trees by lower case letters. The size of 
a finite tree t is the size of the set dom(£). The set of all E-trees in the variables 
V is denoted Tg(V). Moreover, for a subset V n = {xq, . . . , x„_i} of V, we write 
Tg(V n ) for the collection of all trees all whose leaves are labeled in E U V n . 
When n = 0, we write simply T£. 

Trees are equipped with the following partial order T C T" : Given T, T' £ 
Tg(V) such that T ^ T", we define T C T' if and only if for all words u, if 
T(m) is defined, then T(u) = T'(u). It is well-known that the partially ordered 
set (Tg(y),c) is w-complete, i.e., Tg(V) has as least element _!_, the totally 
undefined tree, and least upper bounds of all w-chains. Indeed, if To C T\ d . . . 
is an w-chain in Tg(V), the supremum is the tree T whose domain is the union 
of the domains of the trees T n , and, if u is a word in this union, T(u) = a £ S 
if and only if T n (u) = a, for some n. Similarly, T(u) = Wj for some w £ V 
if and only if T n (u) = Vi for some n. Each symbol a £ induces a A:-ary 
operation on Tg(V) in the usual way. It is well-known that these operations 
are w-continuous in all arguments and T£(V) is an w-continuous algebra. In 
the same way, for every n, T£(V n ) is an w-continuous algebra (in fact, the free 
w-continuous E-algebra on V n ). See [GTWW771 [Gue8l] . 

2.5 Tree substitution 

In this section we define a substitution operation on trees, sometimes called 
second-order substitution. 

Suppose that E and A are ranked alphabets and for each a £ E„ we are given a 
tree R a £ Tg(V n )- We define substitution in two steps, first for finite trees, and 
by continuity for infinite trees. For each finite tree t £ Ts(V) we define the tree 
S = t[a M> -Rcrjcres in TJ£(V), sometimes denoted just t\a h4 R a ] by induction 
on the size of t. When t is the empty tree _L, so is S. When t is x, for some 
x £ V, then S = x. Otherwise t is of the form a(to, . . . ,£„_i), where a £ E„, 
and we define 

S = R a (t[, ■■■,t' n ) 
where t' i = ti[a !->• R a ], for all i. 

Suppose now that T is an infinite tree in Ts(V). Then there is an ascending 
w-chain (t n ) of finite trees such that T — sup n i ra . We define 

T[a i — y Ra] = s\vpt n [a H> R a ). 

n 

It is known, see |Cour83] . that substitution is w-continuous. 
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Proposition 2.3 Substitution is a continuous function 
T£{V) xJlmVnf" -+T£{V). 

n 

Below when E and A are not disjoint and R a = cr(xi, . . . , x n ) for some a G E n , 
then we often omit a from the arguments of the substitution. 

2.6 Algebraic trees and ordinals 

Now consider a finite system E of equations of the form 

F i (a;o,...,x„ 1 _i) = ti(x , . . . , x ni -i) (3) 
F 2 (x , . . . ,x n2 -i) = t 2 (x , ■ ■ ■ ,x„ 2 _i) 

(^0 : ■ • • : *^n m — 1 ) ^rn (*E0 : • • ■ : 

where, for i = 1, . . . , m, U is a term over the ranked alphabet E U T in the 
variables {xq, . . . , x ni -i} (i.e., finite complete tree in Tg(V ni )), where T = 
{Fx, . . . , F m } is the set of "function variables" and each Fi has rank rij. See the 
example in ((4]) below. Each term ti induces a function 

tf : Tg(V ni ) x ... x Tg(V„ m ) -> T£{V ni ) 

by substitution: 

(i?l, . . . , R m ) H> U[Fj h-> -Rj]l<j<mH 

By Proposition 12.31 this function is w-continuous. The target tupling 

mapping Tg(V ni ) x . . . x Tg(V nm ) to itself is also w-continuous and has a least 
fixed point (Ti,...,T m ). Thus, there is a least solution (Ti,...,T m ) of any 
such system. One function variable Fi of rank is selected as the principal 
variable, and the corresponding tree Ti is the principal component of the 
least solution. (Typically, we choose the first function variable as the principal 
variable.) If every integer ni, i = 1, . . . ,m, is zero, the system is said to be 
regular. 

Definition 2.4 A trecT inTg(Vk) is algebraic inT£(Vk), (respectively, reg- 
ular), if there is a finite system E , (respectively, regular system), of equations 
as above such that T is the principal component of the least solution of E in 
T£(V k ). 

3 It is understood here that each a S Sfe remains unchanged, i.e., gets substituted by 
a(x ,---,x k - 1 ). 
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An alternative definition is possible by interpreting finite systems of fixed point 
equations directly on the "continuous categorical algebra" of linear orders. See 
[BE 10] . 

We will use the above definition mainly when k — 0. It is known that when 
T £ Tg(V n ) and n < m, then T is algebraic in T£(V n ) if and only if it is 
algebraic in T^(V m )- Thus, we may simply call T just algebraic. Moreover, we 
say that a tree T £ T£(V) is algebraic if it is algebraic in T^(V n ), for some 
n > 0. It is also known that when £ C then a tree T £ (V) is algebraic 
if and only if T is algebraic in T£,(V). Similar facts and conventions hold for 
regular trees. So below we can simply say that a tree is algebraic, or regular 
without specifying exactly the ranked alphabet. 

The set of leaves of a tree T in Tg(V) is denoted 

Fr(T) = {«e[n]':r(«)eL uy}, 

and is called the frontier of T. Fr(T) is a prefix language linearly ordered by 
<i. Here, n is the maximum of the ranks of the symbols in S. 

Definition 2.5 A linear order is algebraic (respectively, regular) if it isomor- 
phic to (Fr(T),<i) for some algebraic (respectively, regular) tree. An algebraic 
or regular ordinal is an ordinal which is an algebraic or regular linear order. 

Heilbrunner [Heil80 proved that the frontiers of regular trees are those obtain- 
able from the empty and one point frontiers by the operations of "concatenation, 
omega and omega-op powers" , and infinitely many "shuffle" operations. It is 
an easy corollary of this fact that the regular ordinals are those less than uf, 
(There is a somewhat longer argument based only on the facts in |BC01j .) In 
this paper we will prove that an ordinal is algebraic if and only if it is less than 

UJ U ". 

The following fact is known. See |Cour83| . 

Proposition 2.6 The classes of algebraic and regular trees are closed under 
first- and second-order substitution. 

In the remaining part of this section we show that for algebraic and regular 
linear orders one may restrict attention to those algebraic or regular S-trees for 
which £„ = unless n — 2 or n = 0. 

Let A be the ranked alphabet with one binary function symbol and one constant 
symbol; otherwise, A„ = 0. 

Proposition 2.7 For any ranked alphabet S and any algebraic tree T £ T£(V) 
there is an algebraic tree T' £ T£ such that (Fr(T), <i) and (Fr(T'), <e) are 
isomorphic. □ 
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For example, consider the system E of equations 

F = <n(a,b,Fi(a)) 
F x (x) = F 2 {x,x) (4) 
F 2 (x,y) = a 1 (a 2 (a), F 2 (x, F 2 (x,y)), y) 

which uses a function symbol o\ in S3. The least solution consists of three trees 
(Tq,Ti,T 2 ) having vertices of out-degree 3. We replace the system E by the 
system 

Fo = g{a, g(a,Fi(a))) 
F x (z) = F 2 (x,x) 
F 2 (x,y) = g(a, g{F 2 (x, F 2 (x,y)), y)) 

in which the right hand terms use only the function variables and the one binary 
function symbol g, and the one constant symbol a. If (T^T-^X^) is the least 
solution of this second system, (Fr(Tj),<^) is isomorphic to (Fr(T/), -Q ), for 
i= 1,2,3. 

Thus, from now on, we will assume that if T is an algebraic tree, then Fr(T) C 
[2]*. 

Example 2.8 Let E contain the binary symbol g, the unary symbol f and the 
constant a. Consider the system 

F = F(a) 
F(x) = g(x,F(f(x))) 

Then the first component of the least solution of this system is the tree 

To = g(a,g(f(a),g(f(f(a)), . . .,g(f n (a), . . .)))). 

Thus, this tree is algebraic. See also lCour78b] I Cour83[ I Gue81f . 

3 Closure properties of algebraic ordinals 

In this section we use certain closure properties of algebraic ordinals to prove 
that every ordinal less than uj u is algebraic. 

Proposition 3.1 Let C be any set of ordinals which contains 0,1, and is closed 
under sum, product and uj-power: i.e., if a, (3 € C, then a + (3, ax j3, a" belong 
to C. Then all ordinals less than oj" belong to C. 
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Proof. This follows from the assumptions and induction, making use of the 
Cantor Normal Form |Roit90[ l Ro s82 , Sier58j for ordinals less than w u . In fact, 
it is known that the set of ordinals less than uj u is the least set of ordinals 
containing 0, 1 which is closed under sum, product, and w-power. □ 

We use Proposition ^. II to show all ordinals less than u u are algebraic. For this 
reason, we fix the ranked alphabet A containing only a binary symbol g and a 
unary symbol a. We show that all ordinals less than ui u arise as frontiers of 
algebraic trees in T£. 

The ordinal is algebraic, since if T is the empty tree, then Fr(T) is the empty 
language, and the order type of the empty language is 0. The ordinal 1 is 
algebraic since the one-point tree is algebraic. 

Suppose that a — o(Fr(T), <e) and (3 — o(Fr(5), <i), where T and S are alge- 
braic trees in T£. 

Proposition 3.2 If a and /3 are algebraic ordinals, so is a + (3. 

Proof. Consider the algebraic tree g(T, S) = g(a, b)[a H» T \b H» S] whose root 
is labeled by the function symbol g and whose left subtree is T and whose right 
subtree is S. Then the tree g(T, S) is algebraic, and its frontier has order type 
a + /3. □ 

Since and 1 are algebraic, we have the easy corollary that every finite ordinal 
is algebraic. 

Proposition 3.3 If a and (3 are algebraic ordinals, so is a x /3. 

Proof. The tree S[a h4 T] is algebraic, and its frontier has order type a x f3. 

□ 



Proposition 3.4 If a is an algebraic ordinal, so is a". 

Proof. Suppose that a — o(Fr(T), <t) for an algebraic tree T in T£. Consider 
the tree T of Example 12.81 and let S be the algebraic tree in T£ obtained by 
substituting the tree T for each vertex labeled /: S = Tq[J i-t T]. Then S is 
algebraic and its frontier is of order type a u . □ 

Corollary 3.5 Every ordinal less than w u is algebraic. 

Proof. By Propositions 13.21 13.31 and 13.41 together with Proposition 13.11 □ 
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4 Grammars 



In our argument proving that all algebraic ordinals are less than w w we will 
use certain context-free grammars, called ordinal grammars. 

Throughout this section, we assume that 

G=(N,T, S,P) 

is a context-free grammar, with nonterminals N, start symbol S G N, terminals 
T = {0, 1}, and productions P. 

We will denote finite words on the alphabet {0, 1} by u,v, w,x,y, . . .; nonter- 
minals will be written X, Y, Z, . . ., and we will denote by p, q, r, s . . . words on 
NUT, possibly containing nonterminals. 

Further, we assume that each context-free grammar has the following properties: 

• Either each nonterminal X <E N is "coaccessible" , i.e., L(X) is a nonempty 
subset of {0, 1}*, where L(X) is the collection of all words u € {0, 1}* such 
that there is some derivation 

X =^> u, 

or N = {S} and P is empty. We write L(G) for L(S). 

• Each nonterminal X is "accessible", i.e., there is some derivation 

S =^> qXr 

where q, r e (NUT)*. 

We end this section with some classical definitions. Suppose X, Y are nonter- 
minals. Write 

Y < X 

if there is some derivation X =?=> pYq for some p and q. Define X«Fif both 
X ^ Y and Y ^ X hold. When Y ■< X but X w Y docs not hold, we write 
Y <X. 

The relation -< is a preorder on the nonterminals, and induces a partial order 
< on the equivalence classes 

[X] := {F:X«y}, 

where [Y] < [X] if Y ^ X. 

We say X is a recursive nonterminal if there is some nontrivial derivation 

X ^ pXq, (5) 
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for some words p, q £ (N U T)*. If not, X is a non-recursive nonterminal. 

When X is non-recursive, and X — > p is a rule, Y" -< X , for all nonterminals in 
P- 

Definition 4.1 77ie height of a nonterminal X is the number of equivalence 
classes [Y] strictly below [X] . If q is a finite word on N U T , the height of q, 
ht(<7), is the maximum of the heights of the nonterminals occurring in q. If q 
has no nonterminals, ht(g) = — 1. 

If there are k nonterminals, ht(X) < k, for all nonterminals X. If X has height 
zero, and Y -< X, then Y » X. 

For any word q in (N U T)*, write L(q) for all words in {0, 1}* derivable in G 
from q. 

4.1 Prefix and ordinal grammars 

Our definition of an ordinal grammar is motivated by Proposition 14.31 below. 
But hrst we need the following fact: 

Lemma 4.2 Suppose L C {0, 1}* is any language. If (L, <e) is not well- 
ordered, then there is a countable descending chain (u n ), n = 0, 1, . . ., of words 
in L such that 

u n+ i < s u n , 

for each n > 0. 

Proof. Suppose that (v n ) is a countable <£-descending chain of words in L. 
Then, for each n, either v n+ i < p v n or v n +i < s v n . Now define uq — vq. Since 
has only finitely many prefixes, there is a least k such that u/c+i <s Vk <p 
. . . < p vq. Then u\ = Vk+i < s uq, since u < s v if u < s w and w < p v, for any 
words u, v, w. Similarly, assuming that u m has been defined as v m > , for some 
m', we may define u m +i as the first Vk such that k > m! and Vk < s u m . □ 

We take note of the following inheritance property. 

Proposition 4.3 If (L(G), <e) is well-ordered, then, for any nonterminal X , 
(L(X), <i) is also well-ordered. 

Proof. If not, by Lemma F4.2[ suppose that there is a countable chain 

. . . < s Ui < s M - 
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of words in L(X). Thus, for each n, u n+ i = x n 0y n and u n — x n lz n , for some 
words x n , y n , z n G {0, 1}*. Since all nonterminals are accessible and coaccessible, 
there are words v, w such that 



for each i > 0, and thus 

VU n+ iW < s vu n w, 
for each n > 0, showing (L{G),<t) is not well-ordered, contradicting the hy- 



Definition 4.4 A grammar G is a prefix grammar if, for each nonterminal 
X , L{X) is a prefix language. An ordinal grammar is a prefix grammar such 
that (L(G), <e) is well-ordered. 

If G = (N,T, P, S) is an ordinal grammar, then by Proposition 14.31 (L(X), <( 
) = (L(X), < s ) is well-ordered, for all X e N. 

The following fact is immediate from Lemma \2. 21 and the definitions. 

Lemma 4.5 If G is a prefix grammar, then for any word q = VoXiVi . . . Vk-iXkVk G 
(N U T)* , L(q) is a prefix language with (L(q), <() — (L(q). < s ) and 



Moreover, if G is an ordinal grammar, then for any word q 6 (NUT)* , (L(q), <i 
) is well-ordered. 

We write o(q) for the order type of (L(q), <t). In particular, o(X) denotes the 
order type of the linear order (L(X), <i) = (L(X), < s ). 

We list some examples of ordinal grammars. Each grammar includes nontermi- 
nals from the set 



vuiW G L{G), 



pothesis. 



□ 



o(L(q), < e ) = o(L(X k ), <t) X . . . X o(L(X{), <t). 




1. UJ 



n 1 -> o + ifii. 



1*0, so that o(Oi) 



n 2 ->• oOi + in 2 . 



L(n a ) 



Ufc,n lfc 01"0 = L(n) 2 , so that o(fi 2 ) 



,2 
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3. uj n+1 

o(n„+i)=o(0 1 )"+ 1 . 

4. cj" 



f2? ->• + lfi?fii. 



L(Qp ) = U„ l"0i(Oi)" = |J„ l"0(l*0) n , so that o(f^) = w a 



5. 

-»• o + in£ +1 o£ 

Note that the first three are regular grammars, but the subsequent ones are 
context-free ordinal grammars. 

We now establish some closure properties of the ordinals of ordinal grammars. 

Proposition 4.6 The set of ordinals o(L(G)), for an ordinal grammar G, is 
closed under sum, products and uj -powers. 

Proof. Suppose that Gi are ordinal grammars, for i = 1,2. Then, the grammar 
with a new start symbol S + and productions 

S+ -> OSi + 152 

together with the productions of G\ and Gi is a grammar satisfying 

o(S+) = o(L(G 1 ))+o(L(G 2 )). 

If, instead, we add the new start S x and the production 

S x S2S1 

to the productions of G\ and G2, we obtain a grammar satisfying 

o(S x ) = o(L(Gi))xo(L(G a )). 
Last, if we add the new start symbol S u and the productions 

S* w -> + lS LJ S 1 
to the productions of G\, we obtain a grammar satisfying 

o(S u ) = o(L(Gi)) u . 
In each case, the constructed grammar is an ordinal grammar. □ 
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Proposition 4.7 Any ordinal less than oj^ is the ordinal o(L(G)), for some 
ordinal grammar G. 

Proof. From Proposition 14.61 and Proposition 13.11 using the fact that every 
finite ordinal is the ordinal of an ordinal grammar. □ 

Remark 4.8 The above proposition also follows from the corresponding result 
for algebraic ordinals and the facts in Section^ 

4.2 A bound on ordinals of ordinal grammars 

In this section, we will show that for any ordinal grammar G, o(L(G)) is less 
than u w " . 

Throughout this section, we assume that G is an ordinal grammar and L(G) 
is infinite. Moreover, we assume that L(X) contains at least two words, for 
each nonterminal X. Note that for each nonterminal X, since L(X) is a prefix 
language, it does not contain the empty word. 

Proposition 4.9 If X q for some X £ N and q = vqXiVi . . . Vk-iXkVk € 

(NUT)* then o(q) = o(X k ) x . . . x o(Xi) < o(X). In particular, o(Xi) < o(X) 
for all i. 

Proof. Immediate from Lemma T4. 5 1 and the fact that L(q) C L(X). □ 
The relations ^ and ~ were defined at the beginning of Section 2] 

Corollary 4.10 IfY <X, then o(Y) < o(X). IfXxsY, o(X) = o(Y). 

Proof. Immediate, from Proposition 14.91 □ 

It is not necessarily true that if o(X) — o(Y) then X w Y. Consider the 
example: 

X 1 -»■ 1X 2 
X 2 -> IX 2 + 0. 

Then X 2 ~< X\, and o(Xi) = 0(^2) = cj, since 

l(x 2 ) - |J ro 
l{x x ) = \Jro. 

n>l 
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Proposition 4.11 For any nonterminal X , there is no derivation X => Xp 
with p =/= e . 

Proof. Indeed, suppose that X Xp. There are nonempty words it, v such 
that p ==>• v, and X u. Thus, X u and X =>■ ™, so that L(X) 

is not a prefix language. □ 

Proposition 4.12 For each derivation X =>- p, either Y -< X holds for all 
nonterminals Y in p, or there is exactly one nonterminal Y occurring in p with 
X « Y , and in this case, p = uYq, for some u S {0, 1}* and for some q with 
ht(q) < ht(X). 



Proof. Suppose that X => qYrZs is a derivation, where Y,Z ~ X. Let 
a = o(X), P = o(Y), 7 = o(Z). Then by Corollary gTUl a = /? = 7, and thus 
by Proposition 14. 9[ a x a < o(qYrZs) < a. This contradicts the assumption 
that L(X) contains at least two words. Similarly, if there are nonterminals 
Y,Z such that X qYrZs with ht(Y) < ht(X) = ht(Z), then o(X) > 

o(X) x o(Z) > o(X) x 2, an impossibility, since o(X) > 2. □ 

The next fact is a basic result. 



Lemma 4.13 Suppose X uXp and X => vXq. If \u\ < |u|, iften 

u < p w. In particular, if \u\ — \v\, then u = v. 

Proof. Assume u < s v, say. Suppose p ==> w and q w' . Then, for each 

n, X u n X w n , so that X =^> u n v z w' w n = y n , where z is any word in 
{0, 1}* with X =>• z. But, y n +i < s Um for each n, contradicting the fact that 
(L(X), <e) is well-ordered. □ 

Lemma 4.14 Suppose there is a derivation X ===> uXp. Suppose also that 
X v. Then either v < s u or u < p v. 



Proof. For any two words u, v there are four possibilities: 

U < s V, V < p U, V < s U, U < p V. 

We show that the first two possibilities are ruled out. 

If u < s v, then, for any n > 1, u n+1 vw n+l < s u n vw n , where w is any terminal 
word with p => w, so that there is a descending chain in L(X). 

If v < p u, then since o/t, » < p uvw, for any word w, so that L(X) is not a 
prefix language. □ 

Recall that a primitive word is a nonempty word w which cannot be written 
as u n , for any word u and integer n > 1. The primitive root |Lot97j of a 
nonempty word d is a primitive word u such that v — u n , for some n > 1. 



18 



Proposition 4.15 Suppose that X is a recursive nonterminal. Then there is a 
unique shortest word Uq £ {0, 1} + such that whenever X ==> uXp for some 
u G {0, 1} + and p G (N U T)* , then u is a power of . 

Proof. Consider any derivation X vXq with v G {0, 1} + and q G (NUT)*, 
let Uq denote the primitive root of v. Thus, itff is the shortest word such that 
v is a power of Uq , and clearly, Uq is primitive. If X uXp then there are 
some m,n> 1 such that \u m \ — \v n \. But then, by Lemma T4.131 



It then follows that u, v are powers of the same word (see |Lot97j for example), 
which implies that u also is a power of Uq . □ 

Below we will write uq for Uq whenever X is clear from the context. 

Proposition 4.16 Suppose that X is recursive and v G L(X). Then, there is 
some n > 1 such that v < s Uq. 

Proof. Indeed, there is a word u such that \v\ < \u\ and X ==^> uXp, for 
some p G (N U T)*. Then, v < s u, by Lemma T4.141 and u — Uq, for some n, by 
Proposition 14. 151 □ 

Thus, for v G L(X), if we choose n as the least integer such that v < s Uq, 
we may write v in a unique way as Uq -1 w where uq is not a prefix of w and 
w < s uq. Moreover, we can write w as xOy where xl is a prefix of Uq. 

Definition 4.17 Suppose that X is a recursive nonterminal, and the word xl 
is a prefix of uq. For each n > 0, define L(n,x,X) as the set of all words of 
the form u^xOy in L(X). Moreover, define L(n,X) = [J x L(n,x,X), where x 
ranges over all words such that xl is a prefix of Uq. 

Lemma 4.18 Suppose n < m. If v G L(n,X) and w G L(m,X), then v < s w. 

Proof. Write v = u^xOy and w — u^x'Oy', where xl and x'l are prefixes of 
uq. If uq = xlr, then 

v = UgxOy 

< s u a l (xlr) m - n x'0y' 
= u^x'Oy' = w. □ 

The following lemma gives an easy upper bound to the ordinal of a union. 
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Lemma 4.19 Suppose that L\,L2 are subsets of {0, 1}* such that for i = 1,2, 
(Li,<i) is well-ordered. Then (Li U L2, <i) is well-ordered. Let o(Li, <e) < cti, 
where on is a (infinite) limit ordinal, so that 1 + ati = oti, i = 1,2. Then 

o(Li U L2, <e) < max{«i x a 2 , cti x ct\\. 



Proof. To show L\ U L2 is well-ordered, suppose that (v n ) is an infinite 
descending chain in L\ U L2. Then, either there are infinitely many v n G Li, 
or infinitely many v n in L2- Either possibility contradicts the assumption that 
both Li and L 2 are well-ordered. 

Without loss of generality we may assume that (Li, <g) is cofinal in (L\\JL,2, <e), 
i.e., for each y S L 2 there is some x s L\ with y < x. For each y £ L 2 let f(y) 
denote the least x £ L% with y < x. For each x G L%, let j3(x) denote the order 
type of the set {1/ e L2 • f(y) = Then, using Lemma \2. II in the last line, 



o(LiU£ 2 ) < ^2(fi{x) + l) 

< + x ) 



— {0L2 + 1) X °L\ 

< as X a%. 

Thus, in this case, o(Li U L2) < 012 X ot\. □ 
By induction, we have: 



Corollary 4.20 For any finite collection {Li : i = 1, . . . ,n} of subsets o/{0, 1}* 
smc/i i/ioi /or eac/i i, (Li, <g) is well-ordered, Li, <g) is well-ordered. More- 
over, ifo(Li, <i) < a>i where cti is a limit ordinal for all 1 < i < n, it holds that 

(U ' 

o{[\Li,<i) < max{a l( i)X...xa lW } 

— 7T 

i 

where n ranges over all permutations of {1, . . . , n}. □ 
The next theorem is one of our main results. 

Theorem 4.1 Suppose that X is a nonterminal of height h. Then 

o(X) < oj uh . 



Proof. We prove this claim by induction on the height of X. Let X be 
a nonterminal of height h and suppose that we have proved the claim for all 
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nonterminals of height less than h. Below we will make use of the fact that the 
set of ordinals less than w w is closed under sum and product. Moreover, when 
h > 0, then for every ordinal a < lu^ there is a limit ordinal j3 < lu u with 
a < (3. Indeed, we can choose j3 = a/ 1-1 x n for some n. 

Case 1. X is not recursive. Then, by the induction hypothesis, we have o(Y) < 
lu u whenever Y occurs on the right side of a production whose left side is X. 
It follows by Proposition ^. 91 that o(p) < w u whenever X — > p is a production. 
Since L(X) is a finite union of the languages L(p), it follows that o(X) < , 
by Corollary 14.201 (If h = 0, L(p) is a single word in {0, 1}*. Moreover, o(X) 
is finite.) 

Case 2. X is recursive. Then for each n, x, L(n,x,X) is a finite union of 
languages of the form L(uqx0p), where there is left derivation X u^xOp. 
It is not possible that p contains a nonterminal Y with X w Y, since it that case 
we would have a derivation X u^xQinXq for some terminal word w and 

some q, contradicting Proposition ^. 151 Thus, Y -< X holds for all nonterminals 
Y occurring in p. It follows by the induction hypothesis that o{L(uqX§p), <£ 
) < uj uH . Thus, using Corollary 14.201 it follows that o(L(n, x, X), <Q) < w w . 
Again, by Corollary [4.201 

a„ = o(L(n,X),< e ) 

< «^\ 

(If ft = 0, a n < w, for all n.) But, by Lemma T4. 181 

o(X) = a + ai + . . . 

— sup{^^ ai : n > 0}. 

ie[n] 

Since X)ie[n] tti ^ ^ or au n ' ^ follows that o(X) < uj^ h . □ 

Corollary 4.21 If G is an ordinal grammar, there is an integer n such that 

o(G) < uj u ". 



Proof. Let n be the height of the start symbol S. Then, by Theorem 14.11 
o(5)<w w ". □ 

We have thus completed the proof of this characterization of the ordinals of 
ordinal grammars: 

Theorem 4.2 An ordinal a is less than uj u if and only if there is some ordinal 
grammar G such that a = o(L(G), <t). 
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5 From algebraic trees to prefix grammars 



In this section we show that each system of equations defining an algebraic tree 
can be transformed (in polynomial time) to a prefix grammai0 generating the 
frontier of the tree. This result allows us to complete the proof of the fact that 
every algebraic ordinal is less than u> u . 

Consider a system of equations 

F i {xQ,...,Xn i -x) = ti(x ,...,x ni -i), i = l,...,m, (6) 

where each ti is a term over the ranked alphabet EUJ 7 in the variables xq, . . . , x ni - 
We assume that F\ is the principal function variable and that n\ = 0. Each 
component of the least solution is an algebraic tree. Let (Ti, T 2 , . . . , T m ) denote 
the least solution of the system ([5]). 

For the ranked alphabet E U T , let A be the (unranked) alphabet whose letters 
are the letters (a, k), (Fi, j) where a £ E„, n > and k £ [n], j 6 [n,] and 
1 < i < m. Let T be a finite or infinite tree in Tg u:F (V). 

Definition 5.1 For each vertex u £ dom(T) we define a word u E A* by in- 
duction. First, 'e — e. When u — vi and T{v) — 5, then 5 E (E U F)k, for some 
k > 0, and we define u = v{5, i). 

We define the "labeled frontier language of T E T^ " as the set of words 
Lfr(T) = {uT{u) : T(u) E E } 

We prove the following. 

Theorem 5.1 When T £ T^ is an algebraic tree, Lfr(T) can be generated by a 
prefix grammar^ 

Proof. Suppose that T is the principal component of the least solution of the 
system (|6]). 

We will define a grammar whose nonterminals N consist of the letters Fi, to- 
gether with all ordered pairs (Fi, j) where i = 1, . . . , m, j £ [n*]. 

The grammar will be designed to have the following properties. 

Claim: For any word u, T{(u) — Xj if and only if (Fi,j) u. And for any 

wordu, Ti(u) £ Eo if and only if Fi uTi(u). Moreover, any terminal word 

4 Here, we allow grammars over an arbitrary (linearly ordered) terminal alphabet. The 
notion of a prefix grammar can be adjusted appropriately. 

5 In ICour83 l. Lfr(T) is called the branch language of T. Courcelle showed that a "locally 
finite" tree T is algebraic if and only if Lfr(T) is a strict deterministic context-free language, 
see [Cour78bllCour83| . 
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derivable from (Fi,j) is of the form u, and any terminal word derivable from Fi 
is of the form uTi(u) for some u G dom(Ti). 

Let r = S U {(ct,j) : cr G j £ [k]}. The grammar generating Lfr(F) is: 
G L = (iV,r,P,Fi), where TV = FU {(F, : 1 < i < m,j G [n*]} and the set P 
of productions is defined below. If t±, . . . , t m are the terms on the right side of 
^ above, then the productions are: 



where u 6 dom(t i ) and = ar^, 

F -)• utj(w) 
where u G dom(ii) and ti(u) G S U J 7 . 

The proof of the fact that the above grammar is a prefix grammar generating the 
language Lfr(T) relies on the above claim, and may be found in the Appendix. 

□ 

Example. Suppose the system of equations is: 

F = F x (a) 
Fi(x) = F 2 (a,x) 
F 2 (x,y) = a(x,a,F 2 (x,F 2 (x,y))) 

where the individual variables x, y stand for xq, x\, respectively. Then the 
productions in Gl are: 

F -> F 1 + {F 1 ,0)a 

F t -> F 2 + (F 2 ,0)a 

(F,0) -> (F 2 ,l) 

(F 2 ,0) (a,0) + (a,2)(F 2 ,0) + (a,2)(F 2 ,l)(F 2 ,0) 

(F 2 ,l) -> (a,2)(F 2 ,l)(F 2 ,l) 

F 2 -> (<r,l)a+ (ff,2)F 2 + (o-,2) (F 2 ,1)F 2 



Corollary 5.2 For every system of equations defining an algebraic tree T G T£ 
one can construct in polynomial time a prefix grammar generating the frontier 
ofT. 

Proof. To get from the prefix grammar Gl which derives Lfr(F) to a gram- 
mar G' which derives Fr(T), replace each letter (a,j) by just j, and delete the 



23 



constant symbols in Eo. Thus, in the example above, the productions of G' are 

F -> Fi + (Fi,0) 

Fx -> F 2 + (F 2 ,0) 

(F x ,0) -> (F 2 ,l) 

(F 2 ,0) -> + 2(F 2 ,0) + 2(F 2 ,l)(F 2 ,0) 

(F 2 ,l) -»• 2(F 2 ,1)(F 2 ,1) 

F 2 1 + 2F 2 + 2(F 2 ,1)F 2 . 

It follows that G' is a prefix grammar generating Fr(T). □ 

Corollary 5.3 //a is an algebraic ordinal, there is an ordinal grammar G' with 
a = o(L(G')). 

We may now derive our main theorem. 

Theorem 5.2 An ordinal is algebraic if and only if it is less than oj" . 

We needed only to prove the "only if" direction. But this follows immediately 
from Corollary 15.31 and Theorem 14.21 □ 



6 Conclusion 

We have proved that the algebraic ordinals are exactly those less than w w , or 
equivalently, the ordinals that can be constructed from and 1 by the sum and 
product operations, and the operation a t— > a u . It is known that the regular 
ordinals are those less than u", or equivalently, those that can be constructed 
from 0, 1 and oj by just sum and product; or the ordinals that can be constructed 
from and 1 by sum, product, and the operation a n> a x uj. 

Recall (from [Ros82] . for example) that the HausdorfF rank of a countable 
scattered linear ordering L is the least ordinal a such that L € V a , where 

V := {0,1}, 

and if a > 0, the collection V a is defined by: 

iel /3<a 

where I is either ui, or ui* } the reverse of w, or a finite ordinal n, or uj* + lo. 

In order to characterize the algebraic linear orders, the next step might be 
the characterization of algebraic scattered linear orders. We conjecture that 
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any such linear order has Hausdorff rank less than us u . Moreover, one possible 
conjecture is that, up to isomorphism, these are the linear orders that can be 
constructed from the empty linear order and a one point linear order by the 
sum and product operations, reversal, and the operation P i— > P u , where 



This conjecture is supported by the fact that the scattered regular linear orders 
are exactly those that can be constructed from 0, 1 and lu by the sum and 
product operations and reversal, cf. |Heil80j . Thus, the scattered regular linear 
orders have finite Hausdorff rank, but the converse is false: the linear order 



where Z denotes the linear order of the negative and positive integers is not 
regular, but has finite Hausdorff rank. 

After describing the scattered algebraic linear orders, the next task could be to 
obtain a characterization of all algebraic linear orders. We conjecture that these 
are exactly those linear orders that can be constructed from dense algebraic 
words by substituting a scattered algebraic linear order for each letter. (See 
below for the definition of an algebraic word.) Thus, the task can be reduced 
to the characterization of the dense algebraic words. 

A hierarchy of recursion schemes was studied by [Damm77, Damm82, Gal84, 
Ong07, HMOS07 , and many others. The schemes considered in this paper are 
on the first level of the hierarchy with regular schemes forming level 0. In the 
light of the characterizations of the regular and algebraic ordinals, it is natural 
to conjecture that the ordinals definable on the nth level of the hierarchy are 
those less than 

it (a;, n + 2) =oj hJ ' 

where there are n + 2 cj's altogether. In fact, every ordinal less than -ft- (u>, n + 2) 
is shown to be definable on the nth level in |Braudj . 

In ordinal analysis of logical theories, the strength of a theory is measured by 
ordinals. For example, the proof theoretic ordinal of Peano arithmetic is eo. Here 
we have a similar phenomenon: we measure the strength of recursive definitions 
by ordinals, and we conjecture that the definable ordinals are exactly those less 
than e . 

A generalization of the notion of "finite word" is obtained by considering labeled 
linear orders, where the labels are letters in some finite alphabet. Thus, a linear 
order may be identified with a word on a one letter alphabet. A countable 
word is word whose underlying linear order is countable. A morphism between 
words is a morphism between their respective underlying linear orders that ad- 
ditionally preserves the labeling. Every countable word can be represented as 




Z+l + Z + 2 + Z+ ... + Z + n + Z + ... 
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the word determined by the frontier of a tree where each leaf retains it label, 
cf. |Cour78a] . Now an algebraic word (respectively regular word) is a word 
isomorphic to the frontier word of an algebraic (respectively, regular) tree. An 
"operational" characterization of the regular words was obtained in |Heil80j . 
where it was shown that a nonempty word is regular if and only if it can be 
constructed from single letter words by concatenation, w-power, the "shuffle 
operations" and reversal. (Note that concatenation corresponds to the sum op- 
eration on linear orders, and w-power to the operation P n- P x uj.) Without 
the shuffle operations, exactly the nonempty scattered regular words can be gen- 
erated, and the well-ordered regular words can be generated by concatenation 
and w-power. It would be interesting to obtain operational characterizations of 
well-ordered, scattered, and eventually, all algebraic words. 

Finally, we would like to mention an open problem. Suppose that a context- 
free language L is well-ordered by the lexicographic order. Is the order type of 
(L, <e) less than ui^l 
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Appendix 

This appendix is devoted to a formal proof of the correctness of the translation 
given in Section O 

Consider the system ([6]) whose least solution in T£(V) is (Ti, Ta, . . . , T m ). To 
this system, we can associate the tree grammar Gt whose productions are 

Fi(xo, . . . , 2C n< -i) -» U+-L, i = l,...,m, 

where _L denotes the empty tree. The start symbol is F±. Let V denote the set 
of individual variables that occur in (J6j) . 

Below we will also assume a new individual variable z and write t — t' ★ t" for a 
tree t over £ U T possibly containing variables in V if t' is a tree with a single 
leaf labeled z and t is obtained from t' by replacing this leaf with a copy of t" . 

Definition 7.1 Suppose that t,t' are finite trees in T£(V). Then t t' in 

Gt if t can be written as s* i^(so, . . . , Sn*-l) f° r some trees s and sq, . . . , s n *-i 
such that t' = s*ti(so, ■ ■ ■ , s n< _i) or t' = s*-_L. The relation =!=> is the reflexive 
transitive closure of . 

It is known, cf., |Cour83[ IGue81j . that for any i = 1, . . . , m and for any word 
u, Ti(u) is defined if and only if there is some finite tree t in T£(V) with 
Fi(xo, ■ ■ ■ , Xm—i) ==^ t such that t(u) is defined, and in that case Ti(w) = t(u). 

Thus, it suffices to prove that the grammar Gl defined in Section[5]and the tree 
grammar Gt are related as follows: 
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Lemma 7.2 Let i G {1, . . . , m} and let t be a finite tree T£(V). Suppose that 
Fi(xo, • ■ • , a^ni-i) =^ t- Then for every u G dom(t) and j G [rii], if t(u) = Xj 
then (Fi,j) u, and if tiu) eE n UJ then Fi ut(u). 

Lemma 7.3 Let i G {1, . . . , m} and j G [rij] . 

1. Suppose that (F^ j) ===> to. Then there exist a finite tree t in T£{V) and a 
word u G dom(t) such that t(u) — Xj , w — u and Fi(xo, ■ ■ ■ , x ni -i) ===>■ t. 

2. Suppose that Fi w. Then there exist t and u as above with t(u) G 
So U T, w = ut(u) and Fi(xo, . . . ,x ni -i) => t. 

Proof of Lemma \7.2\ We argue by induction on the length of the derivation. 
When the length is 0, t = Fi(xo, . . . ,x ni -i). If u = j, for some j G [n<], then 
u = (Fi,j) and we clearly have (Fi,j) ==> (F;,j) = u. If u = e then u = e, 
t(e) = ^ and we have Fj =>- F = ut[u). 

In the induction step, assume that the length of the derivation is positive and 
that our claim holds for all derivations of smaller length. Suppose that t(u) is 
a variable Xj or t(u) G So U F. Let us write the derivation as 

Fi(x ,...,x ni -i) =^> t' => i 

where in the last step we have i' = s-kFk(so, . . . , s nfc -i) and t — s-ktk(so, • • • , s n fc -i) 
or i = s -k _L. In the second case, u G dom(i'), moreover u in t' is the same as 
u in t, or as u in s. Moreover, t(u) = s(u) = t'(u). Thus, using the induction 
hypothesis, we obtain (Fi,j) u or Fi ut'(u) = ut(u) according to 

whether t(u) = Xj for some j G [n,] or £(u) G So U J 7 . 

Assume now that t' = s~k Ffc(so, . . . , Sn fe -i) and i = s * £fc(so, . . . , s n(t _i). Let 
denote the unique word with s(vq) = z. There are two cases. If vq is not a 
prefix of u, then we have that u G dom(t') fl dom(s), u in t is the same as u in 
i', and t(u) — s(u) — t'(u). The proof is completed as before. So let vo be a 
prefix of u. If there is some w G dom(tfc) such that u = vqw and ifc(w) is not 
an individual variable, then u in t is wow, where Vo is computed in s and u; is 
computed in t k - Moreover, t(u) = tk(w) G So UF. By the induction hypothesis 
we have Fj =^ WoFfe, and by construction, F& — > wt k {w) is a production. 
Thus, Fj =^ VQwt' (vow) = Vowtk(w) = ut(u). 

Suppose last that it = vouwi, where to G dom(ifc) with tk(w) — x^ for some 
h G [rife] and v\ G dom(s/,). In that case u in i is vqwvi, where vq and w) 
are as before, and v% is computed in Sh- Moreover, t(u) = Sh(vi) = t'(vohv\). 
Assume that tiu) is the individual variable Xj. Then (Fi,j) vo(Fk, h)v\ by 
the induction hypothesis, moreover, (F k ,h) — > w is a production. We conclude 
that (Fi,j) ==> V()WVi = u. Suppose now that tiu) G S U F. Then Fi 
vo{F k ,h)v 1 t\vohv 1 ) = vo(F kl h)v 1 s h ivi) =^ vowvxShivx) =ut(u). □ 
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Proof of Lemma [7.31 Suppose first that (Fi,j) w. ff the length of the 

derivation is 0, our claim is trivial: let t — Fi(xo, ■ ■ • , x ni -i), u — j. We 
proceed by induction. In the induction step, we can write the derivation as 
(Fi,j) ==>■ wo(Fk,h)wi => woqwi = w, where, by the induction hypothesis, 
there exist some tf and Uq,Ui with t'(uohui) — Xj, Wo(Fk,h)wi — u$hui in 
t' and Fi(xo, . . . , x n( -i) =>■ t' . Since (Fk,h) — >• q is a production of Gl, 
there is some p with tfc(p) = Xh and p = g in t k . Clearly, we can write t' 
as t' = s * Fk(so, • . ■ , s nk —i) where s(uo) = z, so that u\ G dom(s^) with 
Sh(ui) — Xj. Now let t = s-ktk(so, . . . , s nfe _i) and consider the word u — v,Qpu\. 
We have that tlu) — s/,(iii) — t'(uohui) = Xj and u = uopui = woqwi = w in 
t. 

Suppose next that Fj ==> w. If the length of the derivation is 0, then w = Fj 
and we take t = Fi(x$, . . . , x ni _i) and u = e. Assume now that the length of 
the derivation is positive and that our claim holds for shorter derivations. We 
can decompose the derivation either as 

F =^> Wo(F k ,h)wi => w qwi = w 

or as 

F, t w F fe =^> wqWx = w. 

The former case is similar to the previous one, so we only deal with the latter. 
In this case, by the induction hypothesis, there is some tree t' and a word 
u G dom(i') with t'(uo) — F&, w — uot'(u ) and Fi(xo, ■ ■ ■ , x ni -i) V '■ 
Moreover, Ffe — > w± is a production and thus wi = Uitfc(tii) in t/. for some 
tti with tk{u\) G En U J. Since t'(uo) — Ft, we can write t' as t' = s -k 
Ffc(so, . . . , s„ fc _i), where s(mq) — z - Now let t = s -k ife(so, ■ • ■ ; s n k ~i) and let 
it = UoU\. We have that Fj(:ro, ■ ■ ■ , x ni -\) ==> t and u> = wqWx = ut(u). 
Moreover, t(u) = t k {ui) G E U T. □ 

We can now prove the claim formulated in the proof of Theorem 15.11 Suppose 
that Ti(u) = Xj for some i, 1 < i < m and j G [n,]. Then (Fj,j) u 
by Lemma EU And if Tj(tt) G So, then (F,,j) mT»(u). Conversely, if 

(Fi,j) w for some terminal word tu, then by Lemma 17.31 either w = u for 
some u with Fj(tt) = Xj, or w = uTi(u) for some w with Tj(u) G Eq. □ 
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